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Abstract 

A Wishart matrix is said to be spiked when the underlying covariance matrix has a 
single eigenvalue b different from unity. As b increases through b = 2, a, gap forms from 
the largest eigenvalue to the rest of the spectrum, and with b — 2 of order N~'^/'^ the 
scaled largest eigenvalues form a well defined parameter dependent state. Recent works 
by Bloemendal and Virag [BV], and Mo, have quantified this parameter dependent state 
for real Wishart matrices from different viewpoints, and the former authors have done 
similarly for the spiked Wishart /3-ensemble. We use a recursive structure to give an 
alternative construction of the spiked Wishart /3-ensemble, and we give the exact form 
of the joint eigenvalue PDF for the two matrices in the recurrence. In the case of real 
quaternion Wishart matrices (/3 = 4) the latter is recognised as having appeared in earlier 
studies on symmetrized last passage percolation, allowing the exact form of the scaled 
distribution of the largest eigenvalue to be given. This extends and simplifies earlier work 
of Wang, and is an alternative derivation to a result in [BV]. We also use the construction 
of the spiked Wishart /3-ensemble from [BV] to give a simple derivation of the explicit 
form of the eigenvalue PDF. 

1 Introduction 
1.1 Background 

Very recently, an outstanding problem in random matrix theory has been solved from two 
different perspectives [U [20]. The problem relates to real Wishart matrices, specified as the 
ensemble of random matrices of the form X'^X, where X is an n x real Gaussian matrix 
with distribution proportional to 



N 

exp 



Tr(X^XS~^)). (1.1) 
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The case of interest is when the N x N covariance matrix E is of the spiked form 

S = diag(6,l^-^). (1.2) 
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Here the notation 1^-^ denotes the eigenvalue 1 repeated — 1 times, and the eigenvalue b 
corresponds to the spike. 

With '-f = n/N > 1 and n,N large it was shown by Baik and Silverstein [1] that for 
6 > 1 + the largest eigenvalue separates from the remainder of the spectrum, which is 
otherwise supported on ((1 — 1/7)^, (1 + a/t)^)- A simplified derivation of this fact can be found 
in [61 Prop. 2.4], and some recent generalisations are given in This same effect holds for 
spiked Wishart matrices with complex entries 0. There it has been explicitly demonstrated 
that for large N, and with 6 — (1 + 1/^/7) of order N~^^'^, the scaled largest eigenvalues form 
a parameter dependent state at the onset of the eigenvalue separation. It was conjectured in 
[2] that for these same scaling parameters, spiked real Wishart matrices similarly exhibit a 
parameter dependent state. The outstanding problem has been to quantify this state. 

Before discussing the two recent works which solve this problem, let us say some more 
about the complex case. For this the parameter dependent state was shown in [16] (in the 
case n = — 1), in [2] for general 7 > 1, and in [21] for 7 < 1, to be a determinantal point 
process with correlation kernel independent of 7. Technically, the correlation kernel was shown 
to be a rank one perturbation of the familiar Airy kernel [11]. This explicit form was used in 
turn [Ij to express the distribution of the largest eigenvalue in terms of a member the Lax pair 
for the Hasting-McLeod solution of the Painleve II equation, the latter being specified as the 
transcendent q{s) satisfying 

g" = sg + 2g3, q(s) ~ Ai(s). (1.3) 

s— foo 

As an application, a known correspondence [18] between the eigenvalues of complex Wishart ma- 
trices and last passage times for a directed percolation model based on the Robinson-Schensted- 
Knuth correspondence allowed for an interpretation of these results to be given within a sta- 
tistical mechanics setting. 

The most prominent application of Wishart matrices is to principal component analysis 
in multivariate statistics. There X in (11. ip corresponds to a data matrix for n distinct mea- 
surements of different quantities, and thus has real entries. We know from explicit results 
obtained in the null case for 7 > 1 [19] (the null case refers to S = 1^) that the scaled largest 
eigenvalues now form a Pfaffian rather than determinantal point process. The distribution of 
the scaled largest eigenvalue again involves the Hasting-McLeod solution of the Painleve II 
equation (11. 3p . but is distinct from that in the complex case (see e.g. [TU §9.7]). 

More recently this so called soft edge state, for both the real (/3 = 1) and complex (/3 = 2) 
cases, has further been characterised [Tm [22] in terms of the smallest eigenvalues of the stochastic 
Airy operator 

(P 2 

-—+x + ^B'{x), x>0. (1.4) 

Here B{x) denotes standard Brownian motion and the eigenfunctions are subject to a Dirich- 
let boundary condition at a; = 0. This in turn allows for a diffusion characterisation of the 
distribution of the largest eigenvalue. 

In two recent works — by Bloemendal and Virag [8] and Mo [20] — the problem of quantify- 
ing the soft edge, parameter dependent state for spiked real Wishart matrices has been solved. 
The characterisations are very different, in keeping with the two distinct characterisations of 
the scaled largest eigenvalues revised above in the null case. 
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Consider first the work [8J. With 



77 



■(6-(l + l/V7)) ^ 



w 



{n"^l^ + A^-i/2)2/3 

as n, iV — )• oo, and the scahng of the large eigenvalues Ai > A2 > ■ ■ ■ of X'^X 

1 1 



^/^^ (^-1/2 + iV-l/2)4/3 



Aa 



n 



N)' 



-'.Yu 



it is proved that {1^} form a well defined parameter dependent state. The latter is again 
specified by the smallest eigenvalues of the stochastic Airy operator, but now with the eigen- 
functions satisfying the boundary condition = wil){0). Furthermore, it is shown that the 
distribution function for the largest eigenvalue, F^^w^x) say, is the unique bounded solution to 
the boundary value problem 



dF 2 d'^F 
dx (5 dw"^ 
F{x,w) -)■ 1 
F{x,w) -> 



+ [x — w 



dF 







dw 

as x,w ^ 00 together 

as w ^ —00 with X < xo < 00, 



;i.5) 



where xq is fixed. An essential step, following [23l [9] is to use Householder transformations to 
reduce X^ to the N x N bidiagonal form 



VbX^n 

XP{N-1) X/3(n-l) 

Xp{N-2) X/3(n-2) 

X/3 



Xl3{n-N+1) 



1.6) 



with [3 = 1. Here xli refers to the particular gamma distribution V[n/2,2] (in general r[s, a] is 
specified by the PDF. proportional to x''~^e~^/'^, x > 0), it has been assumed for definiteness 
that n > N and some zero columns which do not effect the non-zero eigenvalues of X^X have 
been removed. 

It is the Pfaffian point process characterisation of the null case that is generalised in [20] . 
Here knowledge of the joint eigenvalue PDF in the finite system is essential. With even, this 
is shown to be proportional to 



N 



(n-Af-l)/2 A,/2 



n 

^<i<k<N 



Afc) 



N 



6-1 \-i/2 



for r a simple closed contour enclosing the branch points of the integrand. Next, using integra- 
tion methods based on skew orthogonal polynomials (see [T¥| Ch. 6]) a Pfaffian formula is given 
for the correlations, and the distribution of the largest eigenvalue is expressed in terms of the 
corresponding Fredholm determinant. (Actually, the working of [20j applies to the finite system 
only; in the abstract of that work we are told the asymptotic analysis will follow shortly.) 
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The starting point of [20] is the expression 
1 ^ r 

1 TT^(n-A.-l)/2^_A,/2 / ^-lTr(OX-XO-E-) (qT^q) ^ ^^ g) 

CfJl JoiN) 

where C is the normahzation, for the eigenvalue PDF of real Wishart matrices. In the case 
of complex Wishart matrices, the corresponding formula involves an average over U{N) rather 
than 0{N). According to the well known Harish-Chandra/ Itzykson-Zuber formula (see e.g. [HI 
Prop. 11.6.1]) an evaluation in terms of determinants is possible. However, until [20], it was 
not known that the 0{N) matrix integral admitted a tractable evaluation. 

1.2 An alternative viewpoint and outline 

We have seen that two seemingly distinct viewpoints have led to the quantification of the pa- 
rameter dependent state formed at the spectrum edge for spiked real Wishart matrices in the 
critical regime. In this paper we will emphasize a third viewpoint. The idea, initiated in [16], is 
to consider the spiking as a perturbation, and to focus attention on the joint eigenvalue distri- 
bution of the perturbed and unperturbed matrices. This follows naturally from the recurrence 

XX^ = XX^ + hxx^ (1.9) 

for nx N matrices X specified by (11. ip and (II. 2p . where X is an n x (A^ — 1) standard Gaussian 
obtained from X by deleting its first column. 

We will use this formalism to give an alternative construction of the spiked Wishart /3- 
ensemble, specified as the random matrices B'^Bp, with i?^ the N x N bidiagonal matrix (II. 6p . 
This in turn relies on knowledge of the eigenvalue PDF for the Wishart /3-ensemble as specified 
in terms of (II. 6p . We begin in Section 2 by showing how to deduce the eigenvalue PDF (I1.7P 
from the bidiagonal matrix (II. 6p with /3 = 1. Our derivation applies for all /3 > 0, so we are 
able to give the /3 generalization of (II. 7p . This is given in (12. ip below. 

In Section 3 we use known results from [TB] to compute the joint eigenvalue PDF of the 
non-zero eigenvalues of the random matrix pair {XX^ , XX"^), as related by (II. 9p but with 
XX'^ replaced by diagXX^ , in the case that the non-zero eigenvalues of XX'^ have PDF 
proportional to 

N-l 

1=1 l<j<k<N-l 

Results from [TH] tell us that this joint eigenvalue PDF can be realized as the zeros of two 
polynomials generated recursively from a three term recurrence. We give the explicit form of 
the matrix eigenvalue problem implied by the recurrences. Although different to the tridiagonal 
matrix eigenvalue problem for B'^Bp, it similarly involves only 2A^ — 1 independent entries. 

We also take up the problem of integrating over the eigenvalues of XX^, with the aim of 
showing that the non-zero eigenvalues of XX^ have the same PDF as found in Section 2 for 
the spiked Wishart /3-ensemble, thus providing an alternative construction of this ensemble. 
We remark that this construction also allows for a /3-generalisation of the general variance 
Wishart ensemble, and the corresponding eigenvalue PDF. From the latter, for the special 
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value f3{n — + l)/2 — 1 = 0, it is possible to show that the probability of no eigenvalues in 
(0, s) has a simple exponential distribution. 

With n < N, XX^ no longer has any zero eigenvalues. This setting is studied in Section 4. 
In the case /3 = 4, and for a special n we obtain a joint eigenvalue PDF proportional to 

AT 

l<i<j<N i,j=l 

subject to the interlacing 

xi> yi> X2> y2> ■ ■ ■ > xn > Vn (1-12) 

The corresponding parameter dependent soft edge correlations were calculated as a Pfaffian in 
|15j . By the universality results of [8] these same correlations must hold for all cases of the 
/9 = 4 spiked Wishart matrices (i.e. for all choices of n and in (11.61) provided they both 
go to infinity). Moreover, results from [3] tell us that the scaled distribution of the largest 
eigenvalue with PDF (11.121) can be written in terms of the same member of the Lax pair for 
the Hastings-McLeod solution of the Painleve II equation as known for the complex case pj. 
And universality tells us that this result must persist for all cases of the /3 = 4 spiked Wishart 
matrices. An alternative derivation of this fact was given by Bloemendal and Virag p], who 
showed that the distribution satisfies (11.51) . Our results of this section extend the results of 
Wang [25j, who considered a particular value of the parameter only. 



2 Eigenvalue PDF for the spiked Wishart /3-ensemble 

By the spiked Wishart /3-ensemble we refer to the tridiagonal matrices BjBj^, with i?^ the NxN 
bidiagonal matrix (II. 6p . For /3 = 1,2 and 4 we know that this tridiagonal matrix corresponds 
to a unitary similarity transformation of the spiked real, complex and real quaternion Wishart 
matrices, and so shares the same eigenvalue PDF. Here we seek the eigenvalue PDF of B^Bp 
for general /3 > 0. In the case & = 1, this has been done in [9j. We can adapt the workings of 
that calculation to the general 6 > case. 



Proposition 2.1 The tridiagonal matrix B^Bp has eigenvalue PDF proportional to 



N 



/3(„-7V+l)/2-l 



n 

l<j<k<N 



- Ai 



oo 



Jt 



N 



it 



-f}/2 



dt. 



(2.1) 



Proof. Let us write 



Xn 



(2.2) 



Then, according to the definition (II. 6p . the probability measure P{Bp){dBp) has, up to pro- 
portionality, the factorization 



(p{Bp){dBf,) 



,{l-l/b)xl/2 



6=1 



(2.3) 
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Let us denote by {Xj}j=i js[ the (ordered) eigenvalues of Bp, and by {qj} the first component 
of the corresponding (normahzed) eigenvector. The working of |9] (see also [HI proof of Prop. 
3.10.1]) tells us that in terms of these variables P(i?^)((ii?^) |fe=i is proportional to 



N 



N 



N 



j=l l<j<k<N i=l j=l 

where S{-) denotes the Dirac delta function. Furthermore, if we write 

&Af-l dN-l bN-2 



&2 0,2 bi 

bi ai 



then we see from (12. 2 p that = x^. But we also know [HI proof of Prop. 1.9.3] that 
a^v = J2^=iQj^j- Hence, substituting x"^ = Yl!j=iQ^^i (12. 3 p we see from (12. 4 p that our 
remaining task is to compute 



I- N N 



{dq). 



(2.5) 



i=i j=i 
Introducing the integral form of the delta function 

N 



5(yq^^-{]= hm - r e^^U^-^)^^ dt 

and supposing temporarily that 6 < 1 so the coefficient on the exponential in (II. 6p is negative, 
we see that (12. 5p is equal to 



1 

2^ 



Evaluating the integral, up to proportionality this reduces to 



oo 



N 



Jt 



Y[ ( It 



6 - 1 . \ 



"2r^^ 



dt. 



(2.6) 



and we see furthermore that the restriction to 6 < 1 can now be relaxed. Multiplying the 
eigenvalue dependent factors of (12. 4p with (12. 6p gives (12. ip . □ 



3 An alternative construction of the spiked Wishart p- 
ensemble 

3.1 Joint eigenvalue PDF for {XX^,XX^) 

We begin by giving the derivation of the recurrence (ll.9p . With X distributed as in (II. ip . 
set X = YY}!'^. We see that Y^Y is then distributed as a real Wishart matrix with variance 
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matrix equal to the identity (S = In). With S as in (11 ■2p it then follows that each element 
in the first column of X has distribution N[0, \/b] (i.e. is a zero mean, standard deviation \/b 
Gaussian), and all other elements are distributed independently as N[0, 1]. Hence the matrix 
product XX^ can be factorized according to the RHS of (II. 9p . An analogous factorization 
holds for X having complex elements (/3 = 2) or real quaternion elements {(3 = 4). Moreover 
in each case, by the invariance of the distribution of a Gaussian vector under conjugation by a 
unitary matrix, we have that 

ePDF XX^ = ePDF (^diag (XX^) + bxx^^ (3.1) 

where with := (the square of the entries of x), we have that each {qi} is distributed 
according to the gamma distribution r[/3/2, 2]. The notation ePDF in (13. ip refers to the 
eigenvalue PDF. 

For definiteness, let us suppose that n > N. Then XX^ will have n — N+1 zero eigenvalues, 
with the PDF of the remaining N — 1 eigenvalues given by (ll.lOp [HI eq.(3.16)] (in the case 
/9 = 4 of real quaternion entries, all eigenvalues are furthermore doubly degenerate). We take 
up the the problem of computing the joint distribution of the eigenvalues of XX"^, and the 
eigenvalues of XX^, with the two matrices related by (13. ip . 

Let the non-zero eigenvalues of XX"^ be denoted by {yi}i=i,...,N-i- It is a simple exercise to 
show that the secular equation for the eigenvalue problem implied by (13. ip is 

N-l 

O-l + K-f + E^). (3.2) 

where qj = T[P/2, 2] {j = 1, . . . , N - 1) and go = r[/3(n - + l)/2, 1/2]. Furthermore, we know 
from [161 Cor. 3] that the PDF of the roots of this equation, and thus the conditional PDF of 
the non-zero eigenvalues {xj}j=i^...^Ar of XX^, is proportional to 

N N~l 

n-N+l)/2-l xj/{2b) TT -/Sin-N+2)/2+l y^/2b 



JJ^^.(n-iV+l)/2-lg_.,/(2.)-Q^- 



e 

j=i 1=1 

>< n t'-7.?-' nnV.-^.r-'. (3.3) 

\.Li<j<k<N-i\yj yk) j^]^ 

subject to the interlacing (11.120 with := 0. Our sort result can now be deduced. 

Proposition 3.1 Let XX^ andXX'^ he related by liS. and suppose that the non-zero eigen- 
values of XX^ are denoted {yi}i=i^...^N-i o-nd have PDF given by U.10\) . With {a;j}j=i...._Ar 
denoting the non-zero eigenvalues of XX'^ , we have that the joint eigenvalue PDF of both sets 
of non-zero eigenvalues is proportional to 

N N-l 
JJa;^('^-^+l)/2-lg-x,/2b -Q ^-(l-l/b)yi/2 

i=i 1=1 

N N-l 

X n (y^-y^^^ n i^^-^k)l[l[\x.-y,f^'-\ (3.4) 

l<j<k<N-l l<j<k<N i=l j=l 

subject to the interlacing ^1.1^) with y^ := 0. 
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Proof. The joint PDF is given by the product of the conditional PDF for {xi}i=i^,,,^N given 
{yi}i=i^,,,^N-i, times the PDF of {yi}i=i,...,N~i- Thus we need to multiply together (13. 3 p and 
([TID]), and dSaD results. ' ' □ 

What is the marginal distribution of {a;j}j=i^...^7v? In the cases (3 = 1, 2 and 4, the construc- 
tion (13. ip is equivalent the distribution of X^X being given by the spiked Wishart distribution 
(II. ip and (II. 2p . Furthermore the non-zero eigenvalues of XX^ are the same as the non-zero 
eigenvalues of X^X. Hence it must be in these cases at least, {xi}i=i^,,,^N has PDF (12. ip . We 
would like to show that this remains true for general /3 > 0. Our task then is to integrate over 
{yi}i=i,...,N-i in (13. 4p . and show that (12. ip results. This can be accomplished by the use of Jack 
polynomial theory flM Ch. 12&13]. But before taking on this task, we will make note of a 
realization of (13. 4p in terms of a generalised eigenvalue problem. 



3.2 Relationship to a bidiagonal generalised eigenvalue problem 



Let 



a,^r[(iV-j)/3/2 + «o + l,2] (j 



^r[j/3/2,2] (j 



N 



..,N-1) aN = VbT[ao + 1,2] 
6^_i4v^r[(iV-l)/3/2,2] 



(3.5) 



where ao := f3{n — + l)/2 — 1, and set too bo := 0. Then results from [161 Section 5.2] tell us 
that with the monic random polynomials {Bj{x)}j=o^,,,^N defined by the three term recurrence 



Bj{x) = {x — aj)Bj_i{x) — hj^ixBj_i{x) {j 



(3.6) 



we have that the joint PDF of the zeros of {Biy{x), B]y_i{x)) is given by (13.41) . 

In general (see e.g. p4]) the recurrence (13.60 is satisfied by the characteristic polynomials 
Bj[x) = det(xMj — Lj) where Lj and Mj are the top j x j blocks of the bidiagonal matrices 



L :-- 



ai 1 
a2 



aN-i 



1 

Oat 



M :-- 



-hi 



1 

-62 



JN-l 



1 



We see from the specification of the entries of (13.50 that the spike h effects only the single entries 
a^r and &iv_i in L and M respectively, which is analogous to how h enters (II. 6p . 

An open problem is to obtain the stochastic characterisation of the soft edge spiked Wishart 
/9-ensemble starting from the generalised eigenvalue problem Lv = XMv. 



3.3 Jack polynomials and hypergeometric functions 

The conditional PDF (13. 3p is a special case of the Dixon- Anderson density [HI eq. (4.11)]. 
Another special case is the conditional PDF for {?/i}j=i,...,Ar-i given {xj}j=i^..._Ar 

T{Nl3/2) Ui<j<k<N-iiyj - Vk) A , , _ , 1/3/2-1 .0 7^ 

(r(/3/2))^ ni<,<.<^(^. - ^.)^-^ l\ l\ ^ 
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subject to the interlacing (11.121) with yi^ := 0. Let this be referred to as DAAr(/3/2). 

Intimately related to (13. 7p are the symmetric Jack polynomials Pk{z; a), where z = (zi, . . . , z^), 
n denotes a partition of length less than or equal to N (we write £(«:) < A^), and a is a parame- 
ter. The Jack polynomials can be specified as the polynomial eigenfunctions of the differential 
operator 



V dzj J a ^ — Zk\ dz,: dzu 



with leading term given by the monomial symmetric function mf^{z) (see [TH §12.6] for more 
details). Thus with the generalised Pochhammer symbol specified by 

(.) _ TT r{u-ij-l)/a + K,) 
^< -n Tiu-{j-l)/a) ' ^'-'^ 



we have [H eq. (12.209)] 

[/3(iV - l)/2]k 



2//3) = —^^J-J^-^{P^(y- 2//3))da,(^/2), (3.9) 



valid for £(k) < - 1. 

Let us define the quantity d'^ as in [HI eq. (12.60)] (it's precise value plays no explicit role 
in the following), and use this in the definition of the renormalized Jack polynomials 

C^{x]a) = — - — P^{x;a). 
The generalized hypergeometric functions based on Jack polynomials are then specified by 

,F(")(ai, ...,a,M,...,b„x) = Y: ^3.10) 

« I'^l- [bi]i ' ■ ■ ■ [bg]), ' 

Important for our present purposes is the fact that [T^, eq. (13.3)] 

oFo^"^(x) = e^f=i"^. (3.11) 

The use of relevance of the generalized hypergeometric functions is see upon multiplying 
both sides of (EH) by 

[/3(Ar-l)/2]i^/^) (2//3)(l-l /l/l _ N 

[(3N/2fJ^ < V2U J) 

and making use of (I3.10p with p = g = 1 on the LHS. On the RHS we first use the fact that the 
Jack polynomials are homogeneous of degree \k\, and so for c a scalar Pk{xc; a) = c^'^^P^ix; 2/13), 
then use (13. lip . We thus obtain the following corollary of (13. 9p . 

Proposition 3.2 For £{k,) < N — 1 and x := (xi, . . . ,xj\f) we have 

(1/6-1) EfJi'% /2\ _ ,f(2//3) 



/ DAiv(/3/2) 



,F^'^'>{^{N - l)/2; /3N/2; (1/6 - l)x/2) 
e(i/fe-i)Ef=i-./2^^^(2//3)^^/2;/3Ar/2; (1 - l/b)x/2). (3.12) 
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Proof. It remains to explain the second line. This follows from a generalisation of the second 
Kummer identity (13.16)], which states that 

,Fi''\a; c; x) = e^U''\F[''\c - a; c; -x). 

□ 

Comparing the explicit form of DAAr(/3/2) (13. 7p with the joint PDF (13.41) . it follows that 
the marginal distribution of {a;j}i=i ...^at is proportional to 

N 

YY^^in-N^m-^^-JlU^^n -Q _ ^^Y^f["'\p/2-PN/2- (1 - l/6)x/2).(3.13) 

j=l l<j<k<N 

Comparison of (I3.13p with (12. ip shows that our remaining task is to show that for c a scalar 

/°° ^ / \ —13/2 

e''Y[(it-cXj) dt. (3.14) 



For this purpose, we begin by observing from ( ]3.10p and (13.80 that in general iFI;'^^^\i3 /2; b 



x] 



is very special. Thus the only partitions giving a non-zero contribution to the sum (I3.10p are 
of the form {k,0^~^), and so the summation is one-dimensional. In the case b = /3N/2, as 
is the case in (13.141) there is a further special feature, relating to the particular generalized 
hypergeometric function based on two sets of variables [131 eq. (13.20)] 

,4'^'\x; y) := V ^-"^(^)^-"^(^) , (3.15) 

where x := (xi, . . . , x„) and y := (yi, . . . , yn). To see the relation, note that for k = {k, 0^^^) 
we have 

[1/a]^ 

(for the second formula see e.g. [261 ^q. (243)]), while for n with two or more non-zero parts, 
Cit^\y, 0^"^) = 0. Thus the summation over k in (I3.15P is also one-dimensional, and moreover 
we have that 

,Fi'^'\p/2;PN/2;cx) = (^,0^"^)). (3.16) 

We remark that an alternative derivation of the marginal distribution being given by (I3.13P 
with the substitution (I3.16P can be given by using the recursive integration formula [17j (see 
also [161 Appendix C]) 



Qj^'^^^\{x}i=i^,,,^N'-, {z}i=l,...,N) 



= e^-S.--^ (e-^-S.-"i '\n'^^\{y^h=l,...,N-l■, {z^h=^,...,N-l))DA^i2m. (3.17) 

Also, as noted in [26j, there is a further alternative derivation in the cases /3 = 1, 2 and 4. 
Thus with (WdU) denoting the normalized Haar volume form for unitary matrices with real 
(/3 = 1), complex {/3 = 2) and real quaternion {/3 = 4) entries, and H, H^^^ Hermitian matrices 
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with elements from the same field as U, eigenvalues {Aj}, {A^- }, we know 
eq. (13.146)]) 

ygTr(//(0)f/t/,C/)(^t^f/) = ^J-(2//^)(A(0); A). 

This combined with (11. 8p and its /3 = 2 and 4 analogues gives the result. 
Now it has been shown by Wang in [261 Appendix A] that for /3 even 

r ^ 1 

,4"^^\x; (c, 0^-1)) cx / e^- n 7 (3.19) 

Jc l""^ ^j) ' 

where C is a simple closed contour encircling {xj}. Supposing temporarily that c < and 
{xj}j=i„,,^N being in the right half plane allows C to be taken to run along the imaginary axis 
from —ioo to zoo, then be closed as an infinite half circle in the right half plane. But under the 
assumption that c < there is no contribution to the integral along this portion of the contour, 
due to the integrand vanishing exponentially fast. Hence 

/too -I 

and furthermore we can drop the restrictions on c and (3 by analytic continuation (the latter 
requires analytic continuation off the integers; for this we use Carlson's theorem — see e.g. [HI 
Prop. 4.1.4]). 

A useful check on (13.201) is to consider that case = 1. It follows immediately from the 
definition (13.151) that in this case Qj^lf^'^\x] c) = e^^. To reclaim this from (I3.20p we suppose 
temporarily that < [3/2 < 1. Then we an change variables w ^ w + x to deduce that 
qJ^'^^^\x\c) oc e^'^ as required (the restriction < /3/2 < 1 can be removed by analytic 
continuation). We remark that a more complicated formula than (13.201) has been given in [26l 
Appendix A] for the continuation of (I3.19P for general /3 > 0. A crucial difference is that the 
latter formula involves the contour C, whereas the contour in (I3.20p is along the imaginary axis. 

Substituting (I3.20p in (I3.16p . we see that (I3.14p holds as required, and thus {xj}j=i^,„^N as 
implied by the roots of the equation (13. 2p indeed realise the eigenvalue PDF for the spiked 
Wishart /9-ensemble. 



that (see e.g. [TU 
(3.18) 



3.4 Multiple spikes 

Our use of (13.10 has been to perturb the eigenvalue PDF given by (11.100 . As an extension, we 
can construct a family of random matrices {X^^-^^X^-^^^}, j = 1, . . . ,n, dependent on parameters 
{bi, . . . ,bj} by the rewrite of (13.10 

ePDFX^^'^X^^')^ = ePDF (^diagX^^'^^^X^-'-^)^ + bjxx'^y (3.21) 

From the discussion of the first paragraph of §3.1, for /3 = 1,2 and 4 we must have that 
ePDF X'^-'^X^-'^^ is proportional to the known eigenvalue PDF for general variance Wishart 
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matrices with real (/3 = 1), complex (/3 = 2) and real quaternion (/3 = 4) elements. According 
to (11 -Sp (appropriately generalized for /3 = 2 and 4), and (13.181) . the latter is proportional to 



N 



Hxf-'^^'^/'^-'e-'^/' n (A,-A.)Vi^/^^((&-l)/2&;A), (3.22) 

j=l ^<j<k<N 

where (6 — l)/26 := {{bi — l)/26i, . . . , {bjy — l)/2h^). This functional form was proposed very 
recently by Wang [26] as a natural /3-generalisation of the eigenvalue PDF for the general 
variance Wishart matrices. We can use (I3.2ip to give a random matrix realization. 

Thus for general /3 > the conditional PDF (13. 3 p gives a recurrence for ePDFX(^)X(^)^. 
The recursive integration formula (I3.17P tells us that (I3.22p is the solution of this recurrence, 
and thus we can realize (I3.22p as the eigenvalue PDF for this recursively constructed random 
matrix ensemble. 

Although we have emphasized soft edge scaling in the Introduction, it is worth remarking 
that in the case /3(n — + l)/2 — 1 = of (I3.22p (i.e. when the factors of powers of the \j are 
not present), there is a very simple formula for the probability of no eigenvalues in (0, s). The 
latter corresponds to the hard edge gap probability, Epi^; (0,s)) say. According to (I3.22p . in 
this setting 

E^(0; (0,s)) 

-| noo poo 

= rfAi---/ rfA^e-^f-V2 -Q (A,.-Afc)Vr''^((&-l)/2&;A), 

l<j<k<N 

POO POO 

/ dX,--- dX^e-^^-^'^/' n (A,-A,)Vo'^''^((&-l)/2&;A + s), 

l<j<k<N 

where X + s := {Xi + s, . . . , Xjy + s). But we know that [S] 

4'^^\{b - l)/26; X + s) = e^Ef..(f'.-i)/26,^^(2//3)((^ _ ^^/26; A) 



-Ns/2 foo 

~c 



c 

and hence 



Ep{{]- (0, s)) = e-^Sf=i(i/2fe.). (3.23) 

In the case P = 2 (general variance complex Wishart matrices) the result (13.230 has been 
derived previously |T2] 



4 The case /3 = 4 

The conditional PDF (13. 3p holds in the case n > N. If instead n < N the eigenvalues };=!,..., at 
of XX'^ will all be strictly positive. With the eigenvalue PDF of XX'^ determined by (13. ip . 
the corresponding secular equation reads 

N 

= l + 6y^^ (4.1) 
where = r[/3/2, 2] (j = 1, . . . , A^) (cf. E^D- 



12 



Let the roots of (14. ip and thus the eigenvalues of XX'^ be denoted {xi}i=i^,,,^N- Then [161 
Cor. 3] gives that the conditional PDF of {xi}i=i^,,,^N given {?/j}j=i,...,Ar is proportional to 



N N 



ni<i<fc<Af(^j ^fc 

subject to the interlacing (II. 121) . Let us suppose now that {i/j}j=i,...,Ar has PDF proportional to 

N 

We '" n ^y^-y^f- (4-3) 

This is realized by the eigenvalue PDF of the N x N matrix B'^Bp^ with i?J given by (11. 6p in 
the case that 6 = 1 and 

n = N -1 + 2//3. (4.4) 

Multiplying together (14. 2 p and (14.30 shows that the joint PDF of {xi}i=i^,,,^N and {?/j}i,...,7v is, 
up to normalization, given by 

AT 

e-E- .te+(X,-..)/.)/2 -Q _ ^.^^y^ _ -Q _ y^^P/2-1^ ^^^^^ 

l<i<j<N i,j=i 

subject to the interlacing (14. 3p . In the case (3 = 4 this reduces to (II. lip . 

The results of [8] tell us that for a given (3 the parameter dependent soft edge state is 
independent of the ratio n/N, provided both n,N ^ oo. Thus in studying the state we 
are free to choose a particular dependence of n on N, which we take to be (14. 4p . In [12] 
the correlations corresponding to (14.51) with /3 = 4 have been given in terms of a quaternion 
determinant (Pfaffian) with explicit entries. The correlations were computed in the so-called 
parity blind case, when the two species implied by (14.50 — the x's and the y's — are regarded 
as indistinguishable, and the parity aware case when they are not. In the parity blind case the 
soft edge scaled limit was also computed. 

We will discuss first not the correlations, but the distribution function for the largest eigen- 
value. The largest eigenvalue belongs to species x, so we can equally as well work with parity 
aware, species x case, or the parity blind case; our approach relies on working with the latter.. 
In particular, in the special case 6 = 2, /3 = 4, we see that the parity blind system implied 
by (11.111) is precisely the Laguerre orthogonal ensemble with 2N eigenvalues and the weight 
function e~'^/^, specified by the eigenvalue PDF 

^ 2N 

^U'^ixi) n i^^-^^i' (4.6) 

1=1 l<j<k<N 

with w{\) = e~^^^ (A > 0), and /3 = 1. For the PDF (14. 6 p in general, let us denote by 
-E'2iv,^(0; J;w{x)) the probability of no eigenvalues in the interval J. We know from [13] that 
for the Laguerre orthogonal ensemble this probability (which is the distribution function for 
the largest eigenvalue) admits the soft edge scaling limit 

lim E2Ni{0; (16A^ + 4(4Ar)i/3s,oo);e^^/^) 
= \im Eni{0;{AN + 2{2Ny/h,oo)]e-^/^) = ElfJO-,{s,oo)) (4.7) 

N—^oo ' ^ 
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where, with q{x) the transcendent specified by (11.31) . 



E^°i\(0; (s,cx)))j =exp(- {x - s)q^{x) dx - I q{x) dx) (4.8) 



is the square of the distribution function for the scaled largest eigenvalue in the Gaussian 
orthogonal ensemble (GOE; see [HI eq. (9.127)]). Hence with 6 = 2 the distribution of the scaled 
largest eigenvalue in the spiked real quaternion Wishart ensemble is equal to the distribution 
of the scaled largest eigenvalue of the GOE, a fact first deduced in [25] using more complicated 
workings. 

Known results can also be used to specify the distribution function for the scaled largest 
eigenvalue of the /3 = 4 parity blind system for general b in the scaling regime about 6 = 2. 
Specifically, set 

2^/^w 

6 = 2-^, weR, (4.9) 

and let us denote by i?^^^'^'^(0; J; b) the probability that there are no eigenvalues in the interval 
J for the spiked Wishart /9-ensemble with parameter b given by (14. 9 p and n given by (14. 4p . We 
have just seen that 



E^4'''^(0; (s, oo); 6) = ^2Af,4(0; (s, oo); e 

' ui=0 



-A/4N 



and that this in turn permits the scaling limit (14.71) . For general G M we read off from [H 
Th. 7.1] the scaling limit 

lim Ejf ''(O; {IQN + A{ANy/h, oo); 6) = F^{x; w). 

N—>oo ' 

The distribution function is specified in [3] in terms of a Riemann-Hilbert problem for- 
mulation of the Hasting-Macleod solution q{s) i\1.3\\ of the Painleve II equation, or equivalently 
in terms of one member of a Lax pair for q{s). After a slight rewrite [8], the latter reads 

±(f] = ( -m-q'\(f 

dw \g J \-wq + q' - s - q^ J \g 
subject to the initial conditions 

/(s,0) = (7(s,0) = E(s), E(s):=exp(-^ q{t)dt). (4.10) 

Introducing too the notation 

F(s)=exp(^-^ {t~ s)q^{t)dt^ 

the result of [3] is that 

F^{x- w) = \ ((/ + g)E-^'' + (/ - g)E'l'^)F^'\ (4.11) 



14 



Note from fl^TOD . fliTTT]) and (gS]) that we have 



F^(x;0) = E^°i\(0;(s,oo)), 

as is consistent with (14. 7p . 

Next we turn our attention to the soft edge correlation functions for the (3 = 4 case of 
(II. lip . For the finite system, in [12] these have been given in terms of a quaternion determinant 
(Pfaffian) with exphcit entries for both the parity bhnd and parity aware cases. However, these 
are rather lengthy, so we will focus attention on the simplest of the correlations, p(i)(x), which 
corresponds to the density. Let us use the superscripts "a" and "b" to denote aware and 
blind respectively, and furthermore distinguish the two possible aware species {xi} and {yi} by 
writing a,x and a,y respectively. In the notation of [15], for the finite system we have 



P?i)(^) = lf'{x/2,x/2), pl^ix) = Pl^ix) + pl^^ix) (4.12) 

(the factors of 2 in the first equation are due to the difference in scale of (II. lip relative to 
[T5l eq. (1.4)]). Now, by construction of (II. lip , {yi} are distributed according to the Laguerre 
symplectic ensemble (14. 3 p with weight e~^^'^, and so are independent of the parameter b. Known 
results for the scaled soft edge correlations in this ensemble [HI eq.(7.117)] give 

= -K-f*(X,X)--Ai(X) / Aiit)dt, (4.13) 

where 

poo 

ir^°^*(X,F) := / Ai{u + X)Ai{u + Y)du (4.14) 



is the Airy kernel. Also, from the first equation in (I4.12P we read off from [15, eq. (4.27) with 
a = w] that 



p\)(X) := lim AiANy/^p'^fJlQN + AUNy/^X) 

= -K'°^\X,X) -- [ e'"^^~'^/^^K''°^\t,X)dt, 
2 2 oX 



_^ r rfte'"(^-*)/2 r du^K^°'\u,t). (4.15) 

4 J-oo Jx 

As a check, we see from (I4.14p and (I4.15P that 

pl^iX) = K^°'\X, X) + \m{X) r Ai{t) dt. 

This is precisely the /9 = 1 soft edge scaled density for the GOE (see e.g. [HI eq. (7.147)]) as 
is consistent with (14. 7p . 
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